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EP0 742 287 A2 

Description 

The present invention provides probes comprised of nucleotide analogues immobilized in arrays on solid sub- 
strates for analyzing molecular interactions of biological interest, and target nucleic acids comprised of nucleotide ana- 
5 logues. The invention therefore relates to the molecular interaction of polymers immobilized on solid substrates 
including related chemistry, biology, and medical diagnostic uses. 

BACKGROUND OF THE INVENTION 

10 The development of very large scale immobilized polymer synthesis (VLSIPS™) technology provides pioneering 
methods for arranging large numbers of oligonucleotide probes in very small arrays. See, U.S. application, SN 
07/805,727 and PCT patent publication Nos. WO 90/15070 and 92/10092, each of which is incorporated herein by ref- 
erence for all purposes. U.S. Patent application Serial No. 07/082,937, filed June 25, 1993, and incorporated herein for 
all purposes, describes methods for making arrays of oligonucleotide probes that are used, e.g., to determine the com- 

15 plete sequence of a target nucleic acid and/or to detect the presence of a nucleic acid with a specified sequence. 

VLSIPS™ technology provides an efficient means for large scale production of miniaturized oligonucleotide arrays 
for sequencing by hybridization (SBH), diagnostic testing for inherited or somatically acquired genetic diseases, and 
forensic analysis. Other applications include determination of sequence specificity of nucleic acids, protein-nucleic acid 
complexes and other polymer-polymer interactions. 

20 

SUMMARY OF THE INVENTION 

The present invention provides arrays of oligonucleotide analogues attached to solid substrates. Oligonucleotide 
analogues have different hybridization properties than oligonucleotides based upon naturally occurring nucleotides. By 
25 incorporating oligonucleotide analogues into the arrays of the invention, hybridization to a target nucleic acid is opti- 
mized. 

The oligonucleotide analogue arrays have virtually any number of different members, determined largely by the 
number or variety of compounds to be screened against the array in a given application. In one group of embodiments, 
the array has from 10 up to 100 oligonucleotide analogue members. In other groups of embodiments, the arrays have 

30 between 100 and 10,000 members, and in yet other embodiments the arrays have between 10,000 and 1,000,0000 
members. In preferred embodiments, the array will have a density of more than 100 members at known locations per 
cm 2 , or more preferably, more than 1000 members per cm 2 . In some embodiments, the arrays have a density of more 
than 10,000 members per cm 2 . 

The solid substrate upon which the array is constructed includes any material upon which oligonucleotide ana- 

35 logues are attached in a defined relationship to one another, such as beads, arrays, and slides. Especially preferred oli- 
gonucleotide analogues of the array are between about 5 and about 20 nucleotides, nucleotide analogues or a mixture 
thereof in length. 

In one group of embodiments, nucleoside analogues incorporated into the oligonucleotide analogues of the array 
will have the chemical formula: 

40 



45 




so wherein R 1 and R 2 are independently selected from the group consisting of hydrogen, methyl, hydroxy, alkoxy (e.g., 
methoxy, ethoxy, propoxy, allyloxy, and propargyloxy), alkylthio, halogen (Fluorine, Chlorine, and Bromine), cyano, and 
azido, and wherein Y is a heterocyclic moiety, e.g., a base selected from the group consisting of purines, purine ana- 
logues, pyrimidines, pyrimidine analogues, universal bases (e.g., 5-nitroindole) or other groups or ring systems capable 
of forming one or more hydrogen bonds with corresponding moieties on alternate strands within a double- or triple- 

55 stranded nucleic acid or nucleic acid analogue, or other groups or ring systems capable of forming nearest-neighbor 
base-stacking interactions within a double- or triple-stranded complex. In other embodiments, the oligonucleotide ana- 
logues are not constructed from nucleosides, but are capable of binding to nucleic acids in solution due to structural 
similarities between the oligonucleotide analogue and a naturally occurring nucleic acid. An example of such an oligo- 
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nucleotide analogue is a peptide nucleic acid or polyamide nucleic acid in which bases which hydrogen bond to a 
nucleic acid are attached to a polyamide backbone. 

The present invention also provides target nucleic acids hybridized to oligonucleotide arrays. In the target nucleic 
acids of the invention, nucleotide analogues are incorporated into the target nucleic acid, altering the hybridization prop- 

5 erties of the target nucleic acid to an array of oligonucleotide probes. Typically, the oligonucleotide probe arrays also 
comprise nucleotide analogues. 

The target nucleic acids are typically synthesized by providing a nucleotide analogue as a reagent during the enzy- 
matic copying of a nucleic acid. For instance, nucleotide analogues are incorporated into polynucleic acid analogues 
using taq polymerase in a PCR reaction. Thus, a nucleic acid containing a sequence to be analyzed is typically ampli- 

10 f ied in a PCR or RNA amplification procedure with nucleotide analogues, and the resulting target nucleic acid analogue 
amplicon is hybridized to a nucleic acid analogue array. 

Oligonucleotide analogue arrays and target nucleic acids are optionally composed of oligonucleotide analogues 
which are resistant to hydrolysis or degradation by nuclease enzymes such as RNAase A. This has the advantage of 
providing the array or target nucleic acid with greater longevity by rendering it resistant to enzymatic degradation. For 

is example, analogues comprising 2 '-O-methyloligoribo nucleotides are resistant to RNAase A. 

Oligonucleotide analogue arrays are optionally arranged into libraries for screening compounds for desired charac- 
teristics, such as the ability to bind a specified oligonucleotide analogue, or oligonucleotide analogue-containing struc- 
ture. The libraries also include oligonucleotide analogue members which form conformationally-restricted probes, such 
as unimolecular double-stranded probes or unimolecular double-stranded probes which present a third chemical struc- 

20 ture of interest. For instance, the array of oligonucleotide analogues optionally include a plurality of different members, 
each member having the formula: Y— L 1 — X 1 — L 2 — X 2 , wherein Y is a solid substrate, X 1 and X 2 are complementary 
oligonucleotides containing at least one nucleotide analogue, L 1 is a spacer, and L 2 is a linking group having sufficient 
length such that X 1 and X 2 form a double-stranded oligonucleotide. An array of such members comprise a library of uni- 
molecular double-stranded oligonucleotide analogues. In another embodiment, the members of the array of oligonucle- 

25 otide are arranged to present a moiety of interest within the oligonucleotide analogue probes of the array. For instance, 
the arrays are optionally conformationally restricted, having the formula -X 11 — Z— X 12 , wherein X 11 and X 12 are com- 
plementary oligonucleotides or oligonucleotide analogues and Z is a chemical structure comprising the binding site of 
interest. 

Oligonucleotide analogue arrays are synthesized on a solid substrate by a variety of methods, including light- 
so directed chemical coupling, and selectively flowing synthetic reagents over portions of the solid substrate. The solid 
substrate is prepared for synthesis or attachment of oligonucleotides by treatment with suitable reagents. For example, 
glass is prepared by treatment with silane reagents. 

The present invention provides methods for determining whether a molecule of interest binds members of the oli- 
gonucleotide analogue array. For instance, in one embodiment, a target molecule is hybridized to the array and the 
35 resulting hybridization pattern is determined. The target molecule includes genomic DNA, cDNA, unspliced RNA, 
mRNA, and rRNA, nucleic acid analogues, proteins and chemical polymers. The target molecules are optionally ampli- 
fied prior to being hybridized to the array, e.g., by PCR, LCR, or cloning methods. 

The oligonucleotide analogue members of the array used in the above methods are synthesized by any described 
method for creating arrays. In one embodiment, the oligonucleotide analogue members are attached to the solid sub- 
40 strate. or synthesized on the solid substrate by light-directed very large scale immobilized polymer synthesis, e.g., 
using photo-removable protecting groups during synthesis. In another embodiment, the oligonucleotide members are 
attached to the solid substrate by forming a plurality of channels adjacent to the surface of said substrate, placing 
selected monomers in said channels to synthesize oligonucleotide analogues at predetermined portions of selected 
regions, wherein the portion of the selected regions comprise oligonucleotide analogues different from oligonucleotide 
45 analogues in at least one other of the selected regions, and repeating the steps with the channels formed along a sec- 
ond portion of the selected regions. The solid substrate is any suitable material as described above, including beads, 
slides, and arrays, each of which is constructed from, e.g., silica, polymers and glass. 

DEFINITIONS 

50 

An "Oligonucleotide" is a nucleic add sequence composed of two or more nucleotides. An oligonucleotide is option- 
ally derived from natural sources, but is often synthesized chemically. It is of any size. An "oligonucleotide analogue" 
refers to a polymer with two or more monomeric subunits, wherein the subunits have some structural features in com- 
mon with a naturally occurring oligonucleotide which allow it to hybridize with a naturally occurring oligonucleotide in 
55 solution. For instance, structural groups are optionally added to the ribose or base of a nucleoside for incorporation into 
an oligonucleotide, such as a methyl or allyl group at the 2'-0 position on the ribose, or a f luoro group which substitutes 
for the 2'-0 group, or a bromo group on the ribonucleoside base. The phosphodiester linkage, or "sugar-phosphate 
backbone" of the oligonucleotide analogue is substituted or modified, for instance with methyl phosphonates or O- 
methyi phosphates. Another example of an oligonucleotide analogue for purposes of this disclosure includes "peptide 
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nucleic acids" in which native or modified nucleic acid bases are attached to a polyamide backbone. Oligonucleotide 
analogues optionally comprise a mixture of naturally occurring nucleotides and nucleotide analogues. However, an oli- 
gonucleotide which is made entirely of naturally occurring nucleotides [i.e., those comprising DNA or RNA), with the 
exception of a protecting group on the end of the oligonucleotide, such as a protecting group used during standard 

5 nucleic acid synthesis is not considered an oligonucleotide analogue for purposes of this invention. 

A "nucleoside" is a pentose glycoside in which the aglycone is a heterocyclic base; upon the addition of a phos- 
phate group the compound becomes a nucleotide. The major biological nucleosides are p-glycoside derivatives of D- 
ribose or D-2-deoxyribose. Nucleotides are phosphate esters of nucleosides which are generally acidic in solution due 
to the hydroxy groups on the phosphate. The nucleosides of DNA and RNA are connected together via phosphate units 

10 attached to the 3' position of one pentose and the 5' position of the next pentose. Nucleotide analogues and/or nucleo- 
side analogues are molecules with structural similarities to the naturally occurring nucleotides or nucleosides as dis- 
cussed above in the context of oligonucleotide analogues. 

A "nucleic acid reagent" utilized in standard automated oligonucleotide synthesis typically caries a protected phos- 
phate on the 3* hydroxyl of the ribose. Thus, nucleic acid reagents are referred to as nucleotides, nucleotide reagents, 

is nucleoside reagents, nucleoside phosphates, nucleoside-3'-phosphates, nucleoside phosphoramidites, phosphora- 
midites, nucleoside phosphonates, phosphonates and the like. It is generally understood that nucleotide reagents carry 
a reactive, or activatible, phosphoryl or phosphonyl moiety in order to form a phosphodiester linkage. 

A "protecting group" as used herein, refers to any of the groups which are designed to block one reactive site in a 
molecule while a chemical reaction is carried out at another reactive site. More particularly, the protecting groups used 

20 herein are optionally any of those groups described in Greene, et a/., Protective Groups In Organic Chemistry, 2nd Ed., 
John Wiley & Sons, New York, NY, 1991 , which is incorporated herein by reference. The proper selection of protecting 
groups for a particular synthesis is governed by the overall methods employed in the synthesis. For example, in "light- 
directed" synthesis, discussed herein, the protecting groups are photolabile protecting groups such as NVOC, MeNPoc, 
and those disclosed in co-pending Application PCT/US93/1 0162 (filed October 22, 1993), incorporated herein by refer- 

25 ence. In other methods, protecting groups are removed by chemical methods and include groups such as FMOC, DMT 
and others known to those of skill in the art. 

A "purine" is a generic term based upon the specific compound "purine" having a skeletal structure derived from 
the fusion of a pyrimidine ring and an imidazole ring. It is generally, and herein, used to describe a generic class of com- 
pounds which have an atom or a group of atoms added to the parent purine compound, such as the bases found in the 

30 naturally occurring nucleic acids adenine (6-aminopurine) and guanine (2-amino-6-oxopurine), or less commonly 
occurring molecules such as 2-amino-adenine, N 6 -methyladenine, or 2-methylguanine. 

A "purine analogue" has a heterocyclic ring with structural similarities to a purine, in which an atom or group of 
atoms is substituted for an atom in the purine ring. For instance, in one embodiment, one or more N atoms of the purine 
heterocyclic ring are replaced by C atoms. 

35 A "pyrimidine" is a compound with a specific heterocyclic diazine ring structure, but is used generically by persons 
of skill and herein to refer to any compound having a 1 ,3-diazine ring with minor additions, such as the common nucleic 
acid bases cytosine, thymine, uracil, 5-methylcytosine and 5-hydroxymethylcytosine, or the non-naturally occurring 5- 
bromo-uracil. 

A "pyrimidine analogue" is a compound with structural similarity to a pyrimidine, in which one or more atom in the 
40 pyrimidine ring is substituted. For instance, in one embodiment, one or more of the N atoms of the ring are substituted 
with C atoms. 

A "solid substrate" has fixed organizational support matrix, such as silica, polymeric materials, or glass. In some 
embodiments, at least one surface of the substrate is partially planar. In other embodiments it is desirable to physically 
separate regions of the substrate to delineate synthetic regions, for example with trenches, grooves, wells or the like. 
45 Example of solid substrates include slides, beads and arrays. 

DESCRIPTION OF THE DRAWING 

Figure 1 shows two panels (Figure 1 A and Figure 1B) illustrating the difference in fluorescence intensity between 
so matched and mismatched probes on an oligonucleotide analogue array. 

Figure 2 is a graphic illustration of specific light-directed chemical coupling of oligonucleotide analogue monomers 
to an array. 

Figure 3 shows the relative efficiency and specificity of hybridization for immobilized probe arrays containing ade- 
nine versus probe arrays containing 2,6-diaminopurine nucleotides. (3'-CATCGTAGAA-5' (SEQ ID NO:1)). 
55 Figure 4 shows the effect of substituting adenine with 2,6-diaminopurine (D) in immobilized poly-dA probe arrays. 
(AAAAANAAAAA (SEQ ID NO:2)). 

Figure 5 shows the effects of substituting 5-propynyl-2'-deoxyuridine and 2-amino-2' deoxyadenosine in AT arrays 
on hybridization to a target nucleic acid. (ATATAATATA (SEQ ID NO:3) and CGCGCCGCGC (SEQ ID NO:4)). 
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Figure 6 shows the effects of dl and 7-deaza-dG substitutions in oligonucleotide arrays. (3'- 
ATGTT(GlG2G3G4G5)CGGGT-5' (SEQ ID NO:5)). 

DETAILED DESCRIPTION 

5 

Methods of synthesizing desired single stranded oligonucleotide and oligonucleotide analogue sequences are 
known to those of skill in the art. In particular, methods of synthesizing oligonucleotides and oligonucleotide analogues 
are found in, for example, Oligonucleotide Synthesis: A Practical Approach, Gait, ed., IRL Press, Oxford (1984); W.H.A. 
Kuijpers Nucleic Acids Research 18(17), 5197 (1994); K.L Dueholm J. Org. Chem. 59, 5767-5773 (1994), and S. 

10 Agrawal (ed.) Methods in Molecular Biology, volume 20, each of which is incorporated herein by reference in its entirety 
for all purposes. Synthesizing unimolecular double-stranded DNA in solution has also been described. See, copending 
application SN 08/327,687, which is incorporated herein for all purposes. 

Improved methods of forming large arrays of oligonucleotides, peptides and other polymer sequences with a mini- 
mal number of synthetic steps are known. See, Pirrung et aL, U.S. Patent No. 5,143,854 (see also, PCT Application 

is No. WO 90/15070) and Fodor et a/., PCT Publication No. WO 92/10092, which are incorporated herein by reference, 
which disclose methods of forming vast arrays of peptides, oligonucleotides and other molecules using, for example, 
light-directed synthesis techniques. See also, Fodor et aL, (1991) Science, 251 , 767-77 which is incorporated herein 
by reference for all purposes. These procedures for synthesis of polymer arrays are now referred to as VLSIPS™ pro- 
cedures. 

20 Using the VLSIPS™ approach, one heterogenous array of polymers is converted, through simultaneous coupling 
at a number of reaction sites, into a different heterogenous array. See, U.S. Application Serial Nos. 07/796,243 and 
07/980,523, the disclosures of which are incorporated herein for all purposes. 

The development of VLSIPS™ technology as described in the above-noted U.S. Patent No. 5,143,854 and PCT 
patent publication Nos. WO 90/15070 and 92/10092 is considered pioneering technology in the fields of combinatorial 

25 synthesis and screening of combinatorial libraries. More recently, patent application Serial No. 08/082,937, filed June 
25, 1993 (incorporated herein by reference), describes methods for making arrays of oligonucleotide probes that are 
used to check or determine a partial or complete sequence of a target nucleic acid and to detect the presence of a 
nucleic acid containing a specific oligonucleotide sequence. 

30 Combinatorial Synthesis of Oligonucleotide Arrays 

VLSIPS™ technology provides for the combinatorial synthesis of oligonucleotide arrays. The combinatorial 
VLSIPS™ strategy allows for the synthesis of arrays containing a large number of related probes using a minimal 
number of synthetic steps. For instance, it is possible to synthesize and attach all possible DNA 8mer oligonucleotides 

35 (4 8 , or 65,536 possible combinations) using only 32 chemical synthetic steps. In general, VLSIPS™ procedures provide 
a method of producing 4 n different oligonucleotide probes on an array using only 4n synthetic steps. 

In brief, the light-directed combinatorial synthesis of oligonucleotide arrays on a glass surface proceeds using auto- 
mated phosphoramidite chemistry and chip masking techniques. In one specific implementation, a glass surface is deri- 
vatized with a silane reagent containing a functional group, e.g., a hydroxy! or amine group blocked by a photolabile 

40 protecting group. Photolysis through a photolithogaphic mask is used selectively to expose functional groups which are 
then ready to react with incoming 5'-photoprotected nucleoside phosphoramtdites. See, Figure 2. The phosphora- 
midites react only with those sites which are illuminated (and thus exposed by removal of the photolabile blocking 
group). Thus, the phosphoramidites only add to those areas selectively exposed from the preceding step. These steps 
are repeated until the desired array of sequences have been synthesized on the solid surface. Combinatorial synthesis 

45 of different oligonucleotide analogues at different locations on the array is determined by the pattern of illumination dur- 
ing synthesis and the order of addition of coupling reagents. 

In the event that an oligonucleotide analogue with a polyamide backbone is used in the VLSIPS™ procedure, it is 
generally inappropriate to use phosphoramidite chemistry to perform the synthetic steps, since the monomers do not 
attach to one another via a phosphate linkage. Instead, peptide synthetic method are substituted. See, e.g., Pirrung et 

so al. U.S. Pat. No. 5,143,854. 

Peptide nucleic acids are commercially available from, e.g., Biosearch, Inc. (Bedford, MA) which comprise a polya- 
mide backbone and the bases found in naturally occurring nucleosides. Peptide nucleic acids are capable of binding to 
nucleic acids with high specificity, and are considered "oligonucleotide analogues" for purposes of this disclosure. Note 
that peptide nucleic acids optionally comprise bases other than those which are naturally occurring. 

55 

Hybridization of Nucleotide Analogues 

The stability of duplexes formed between RNAs or DNAs are generally in the order of RNA:RNA > RNA:DNA > 
DNA:DNA, in solution. Long probes have better duplex stability with a target, but poorer mismatch discrimination than 



5 



EP 0 742 287 A2 



shorter probes (mismatch discrimination refers to the measured hybridization signal ratio between a perfect match 
probe and a single base mismatch probe. Shorter probes (e.g., 8-mers) discriminate mismatches very well, but the 
overall duplex stability is low. In order to optimize mismatch discrimination and duplex stability, the present invention 
provides a variety of nucleotide analogues incorporated into polymers and attached in an array to a solid substrate. 

s Altering the thermal stability (T m ) of the duplex formed between the target and the probe using, e.g., known oligo- 
nucleotide analogues allows for optimization of duplex stability and mismatch discrimination. One useful aspect of alter- 
ing the T m arises from the fact that Adenine-Thymine (A-T) duplexes have a lower T m than Guanine-Cytosine (G-C) 
duplexes, due in part to the fact that the A-T duplexes have 2 hydrogen bonds per base-pair, while the G-C duplexes 
have 3 hydrogen bonds per base pair. In heterogeneous oligonucleotide arrays in which there is a non-uniform distribu- 

10 tion of bases, it can be difficult to optimize hybridization conditions for all probes simultaneously. Thus, in some embod- 
iments, it is desirable to destabilize G-C-rich duplexes and/or to increase the stability of A-T-rich duplexes while 
maintaining the sequence specificity of hybridization. This is accomplished, e.g., by replacing one or more of the native 
nucleotides in the probe (or the target) with certain modified, non-standard nucleotides. Substitution of guanine resi- 
dues with 7-deazaguanine, for example, will generally destabilize duplexes, whereas substituting adenine residues with 

is 2,6-diaminopurine will enhance duplex stability. A variety of other modified bases are also incorporated into nucleic 
acids to enhance or decrease overall duplex stability while maintaining specificity of hybridization. The incorporation of 
6-aza-pyrimidine analogs into oligonucleotide probes generally decreases their binding affinity for complementary 
nucleic acids. Many 5-substituted pyrimidines substantially increase the stability of hybrids in which they have been 
substituted in place of the native pyrimidines in the sequence. Examples include 5-bromo-, 5-methyl-, 5-propynyl-, 5- 

20 (imidazol-2-yl)-and 5-(thiazol-2-yl)- derivatives of cytosine and uracil. 

Many modified nucleosides, nucleotides and various bases suitable for incorporation into nucleosides are commer- 
cially available from a variety of manufacturers, including the SIGMA chemical company (Saint Louis, MO), R&D sys- 
tems (Minneapolis, MN), Pharmacia LKB Biotechnology (Piscataway, NJ), CLONTECH Laboratories, Inc. (Palo Alto, 
CA), Chem Genes Corp., Aldrich Chemical Company (Milwaukee, Wl), Glen Research, Inc., GIBCO BRL Life Technol- 

25 ogies, Inc. (Gaithersberg, MD), Fluka Chemica-Biochemika Analytika (Fluka Chemie AG, Buchs, Switzerland), Invitro- 
gen, San Diego, CA, and Applied Biosystems (Foster City, CA), as well as many other commercial sources known to 
one of skill. Methods of attaching bases to sugar moieties to form nucleosides are known. See, e.g., Lukevics and Zab- 
locka (1991), Nucleoside Synthesis: Organosilicon Methods Ellis Horwood Limited Chichester, West Sussex, England 
and the references therein. Methods of phosphor ylating nucleosides to form nucleotides, and of incorporating nucie- 

30 otides into oligonucleotides are also known. See, e.g., Agrawal (ed) (1993) Protocols for Oligonucleotides and 
Analogues, Synthesis and Properties, Methods in Molecular Biology volume 20, Humana Press, Towota, N.J., and the 
references therein. See also, Crooke and Lebleu, and Sanghvi and Cook, and the references cited therein, both supra. 

Groups are also linked to various positions on the nucleoside sugar ring or on the purine or pyrimidine rings which 
may stabilize the duplex by electrostatic interactions with the negatively charged phosphate backbone, or through 

35 hydrogen bonding interactions in the major and minor groves. For example, adenosine and guanosine nucleotides are 
optionally substituted at the N 2 position with an imidazolyl propyl group, increasing duplex stability. Universal base ana- 
logues such as 3-nitropyrrole and 5-nitroindole are optionally included in oligonucleotide probes to improve duplex sta- 
bility through base stacking interactions. 

Selecting the length of oligonucleotide probes is also an important consideration when optimizing hybridization 

40 specificity. In general, shorter probe sequences are more specific than longer ones, in that the occurrence of a single- 
base mismatch has a greater destabilizing effect on the hybrid duplex. However, as the overall thermodynamic stability 
of hybrids decreases with length, in some embodiments it is desirable to enhance duplex stability for short probes glo- 
bally. Certain modifications of the sugar moiety in oligonucleotides provide useful stabilization, and these can be used 
to increase the affinity of probes for complementary nucleic acid sequences. For example, 2'-0-methyl-, 2'-0-propyl-, 

45 and 2'-0-allyl-oligoribonucleotides have higher binding affinities for complementary RNA sequences than their unmodi- 
fied counterparts. Probes comprised of 2'-fluoro-2'-deoxyollgoribonucleotides also form more stable hybrids with RNA 
than do their unmodified counterparts. 

Replacement or substitution of the internucieotide phosphodiester linkage in oligo- or poly-nucleotides is also used 
to either increase or decrease the affinity of probe-target interactions. For example, substituting phosphodiester link- 
so ages with phosphorothioate or phosphorodithioate linkages generally lowers duplex stability, without affecting 
sequence specificity. Substitutions with a non-ionic methylphosphonate linkage (racemic, or preferably, Rp stereochem- 
istry) have a stabilizing influence on hybrid formation. Neutral or cationic phosphoramidate linkages also result in 
enhanced duplex stabilization. The phosphate diester backbone has been replaced with a variety of other stabilizing, 
non-natural linkages which have been studied as potential antisense therapeutic agents. See, e.g. , Crooke and Lebleu 

55 (eds) (1993) Antisense Research Applications CRC Press; and, Sanghvi and Cook (eds) (1994) Carbohydrate modifi- 
cations in Antisense Research ACS Symp. Ser. #580 ACS, Washington DC. Very stable hybrids are formed between 
nucleic acids and probes comprised of peptide nucleic acids, in which the entire sugar-phosphate backbone has been 
replaced with a poiyamide structure. 
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Another important factor which sometimes affects the use of oligonucleotide probe arrays is the nature of the target 
nucleic acid. Oligodeoxynucleotide probes can hybridize to DNA and RNA targets with different affinity and specificity. 
For example, probe sequences containing long "runs" of consecutive deoxyadenosine residues form less stable hybrids 
with complementary RNA sequences than with the complementary DNA sequences. Substitution of dA in the probe 

5 with either 2,6-diaminopurine deoxyriboside, or 2'-alkoxy- or 2'-fluoro-dA enhances hybridization with RNA targets. 

Internal structure within nucleic acid probes or the targets also influences hybridization efficiency. For example, 
GC-rich sequences, and sequences containing "runs" of consecutive G residues frequently self-associate to form 
higher-order structures, and this can inhibit their binding to complementary sequences. See, Zimmerman etal. (1975) 
J. Mol Biol 92: 181; Kim (1991) Nature 351: 331; Sen and Gilbert (1988) Nature 335: 364; and Sunquist and Klug 

10 (1989) Nature 342: 825. These structures are selectively destabilized by the substitution of one or more guanine resi- 
dues with one or more of the following purines or purine analogs: 7-deazaguanine, 8-aza-7-deazaguanine, 2-aminop- 
urine, 1 H-purine, and hypoxanthine, in order to enhance hybridization. 

Modified nucleic acids and nucleic acid analogs can also be used to improve the chemical stability of probe arrays. 
For example, certain processes and conditions that are useful for either the fabrication or subsequent use of the arrays, 

is may not be compatible with standard oligonucleotide chemistry, and alternate chemistry can be employed to overcome 
these problems. For example, exposure to acidic conditions will cause depurination of purine nucleotides, ultimately 
resulting in chain cleavage and overall degradation of the probe array. In this case, adenine and guanine are replaced 
with 7-deazaadenine and 7-deazaguanine, respectively, in order to stabilize the oligonucleotide probes towards acidic 
conditions which are used during the manufacture or use of the arrays. 

20 Base, phosphate and sugar modifications are used in combination to make highly modified oligonucleotide ana- 
logues which take advantage of the properties of each of the various modifications. For example, oligonucleotides 
which have higher binding affinities for complementary sequences than their unmodified counterparts (e.g., 2'-0- 
methyl-, 2'-0-propyl-, and 2'-0-allyt oligonucleotides) can be incorporated into oligonucleotides with modified bases 
(deazaguanine, 8-aza-7-deazaguanine, 2-aminopurine, 1 H-purine, hypoxanthine and the like) with non-ionic methyl- 

25 phosphonate linkages or neutral or cationic phosphoramidate linkages, resulting in additive stabilization of duplex for- 
mation between the oligonucleotide and a target nucleic acid. For instance, one preferred oligonucleotide comprises a 
2'-0-methyl-2,6-diaminopurineriboside phosphorothioate. Similarly, any of the modified bases described herein can be 
incorporated into peptide nucleic acids, in which the entire sugar-phosphate backbone has been replaced with a polya- 
mide structure. 

30 Thermal equilibrium studies, kinetic "on-rate" studies, and sequence specificity analysis is optionally performed for 
any target oligonucleotide and probe or probe analogue. The data obtained shows the behavior of the analogues upon 
duplex formation with target oligonucleotides. Altered duplex stability conferred by using oligonucleotide analogue 
probes are ascertained by following, e.g., fluorescence signal intensity of oligonucleotide analogue arrays hybridized 
with a target oligonucleotide over time. The data allow optimization of specific hybridization conditions at, e.g., room 

35 temperature (for simplified diagnostic applications). 

Another way of verifying altered duplex stability is by following the signal intensity generated upon hybridization with 
time. Previous experiments using DNA targets and DNA chips have shown that signal intensity increases with time, and 
that the more stable duplexes generate higher signal intensities faster than less stable duplexes. The signals reach a 
plateau or "saturate" after a certain amount of time due to all of the binding sites becoming occupied. These data allow 

40 for optimization of hybridization, and determination of equilibration conditions at a specified temperature. 

Graphs of signal intensity and base mismatch positions are plotted and the ratios of perfect match versus mis- 
matches calculated. This calculation shows the sequence specific properties of nucleotide analogues as probes. Per- 
fect match/mismatch ratios greater than 4 are often desirable in an oligonucleotide diagnostic assay because, for a 
diploid genome, ratios of 2 have to be distinguished (e.g.. in the case of a heterozygous trait or sequence). 

45 

Target Nucleic Acids Which Comprise Nucleotide Analogues 

Modified nucleotides and nucleotide analogues are incorporated synthetically or enzymatically into DNA or RNA 
target nucleic acids for hybridization analysis to oligonucleotide arrays. The incorporation of nucleotide analogues in the 

so target optimizes the hybridization of the target in terms of sequence specificity and/or the overall affinity of binding to 
oligonucleotide and oligonucleotide analogue probe arrays. The use of nucleotide analogues in either the oligonucle- 
otide array or the target nucleic acid, or both, improves optimizability of hybridization interactions. Examples of useful 
nucleotide analogues which are substituted for naturally occurring nucleotides include 7-deazaguanosine, 2,6<iiami- 
nopurine nucleotides, 5-propynyt and other 5-substituted pyrimidine nucleotides, 2'-fluro and 2'-methoxy -2'<ieoxynu- 

55 cleotides and the like. 

These nucleotide analogues are incorporated into nucleic acids using the synthetic methods described supra, or 
using DNA or RNA polymerases. The nucleotide analogues are preferably incorporated into target nucleic acids using 
in vitro amplification methods such as PCR, LCR, Qp-replicase expansion, in vitro transcription (e.g., nick translation 
or random-primer transcription) and the like. Alternatively, the nucleotide analogues are optionally incorporated into 
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cloned nucleic acids by culturing a cell which comprises the cloned nucleic acid in media which includes a nucleotide 
analogue. 

Similar to the use of nucleotide analogues in probe arrays, 7-deazaguanosine is used in target nucleic acids to sub- 
stitute for G/dG to enhance target hybridization by reducing secondary structure in sequences containing runs of poly- 
s G/dG. 6-diaminopurine nucleotides substitute for A/dA to enhance target hybridization through enhanced H-bonding to 
T or U rich probes. 5-propynyl and other 5-substituted pyrimidine nucleotides substitute for natural pyrimidines to 
enhance target hybridization to certain purine rich probes. 2'-fluro and 2'-methoxy -2'-deoxynucleotides substitute for 
natural nucleotides to enhance target hybridization to similarly substituted probe sequences. 

10 Synthesis of 5'-photoprotected 2'-0 alkyl ribonucleotide analogues 

The light-directed synthesis of complex arrays of nucleotide analogues on a glass surface is achieved by derivatiz- 
ing cyanoethyl phosphoramidite nucleotides and nucleotide analogues (e.g., nucleoside analogues of uridine, thymi- 
dine, cytidine, adenosine and guanosine, with phosphates) with, for example, the photolabile MeNPoc group in the 5'- 

15 hydroxyl position instead of the usual dimethoxytrityl group. See, application SN PCTAJS94/1 2305. 

Specific base-protected 2'-0 alkyl nucleosides are commercially available, from, e.g., Chem Genes Corp. (MA). 
The photolabile MeNPoc group is added to the 5'-hydroxyl position followed by phosphitylation to yield cyanoethyl phos- 
phoramidite monomers. Commercially available nucleosides are optionally modified (e.g., by 2-O-alkylation) to create 
nucleoside analogues which are used to generate oligonucleotide analogues. 

20 Modifications to the above procedures are used in some embodiments to avoid significant addition of MenPoc to 
the 3*-hydroxyl position. For instance, in one embodiment, a 2'-0-methyl ribonucleotide analogue is reacted with DMT- 
Cl {di(p-methoxyphenyl)phenylchloride} in the presence of pyridine to generate a 2'-0-methyl-5'-0-DMT ribonucleotide 
analogue. This allows for the addition of TBDMS to the 3*-0 of the ribonucleoside analogue by reaction with TBDMS- 
Trrflate (t-butyldimethylsilyltrifluoromethane-sulfonate) in the presence of triethylamine in THF (tetrahydrofuran) to yield 

25 a 2'-0-methyl-3 , -0-TBDMS-5'-0-DMT ribonucleotide base analogue. This analogue is treated with TCAA (trichloroace- 
tic acid) to cleave off the DMT group, leaving a reactive hydroxyl group at the 5' position. MeNPoc is then added to the 
oxygen of the 5' hydroxyl group using MenPoc-CI in the presence of pyridine. The TBDMS group is then cleaved with 
F" (e.g., NaF) to yield a ribonucleotide base analogue with a MeNPoc group attached to the 5' oxygen on the nucleotide 
analogue. If appropriate, this analogue is phosphitylated to yield a phosphoramidite for oligonucleotide analogue syn- 

30 thesis. Other nucleosides or nucleoside analogues are protected by similar procedures. 

Synthesis of Oligonucleotide Analogue Arrays on Chips 

Other than the use of photoremovable protecting groups, the nucleoside coupling chemistry used in VLSIPS™ 

35 technology for synthesizing oligonucleotides and oligonucleotide analogues on chips is similar to that used for oligonu- 
cleotide synthesis. The oligonucleotide is typically linked to the substrate via the 3'-hydroxyl group of the oligonucleotide 
and a functional group on the substrate which results in the formation of an ether, ester, carbamate or phosphate ester 
linkage. Nucleotide or oligonucleotide analogues are attached to the solid support via carbon-carbon bonds using, for 
example, supports having (poly)trrf luorochloroethylene surfaces, or preferably, by siloxane bonds (using, for example, 

40 glass or silicon oxide as the solid support). Siloxane bonds with the surface of the support are formed in one embodi- 
ment via reactions of surface attaching portions bearing trichlorosilyl or triaikoxysilyl groups. The surface attaching 
groups have a site for attachment of the oligonucleotide analogue portion. For example, groups which are suitable for 
attachment include amines, hydroxy!, thiol, and carboxyl. Preferred surface attaching or derivrtizing portions include 
aminoalkylsilanes and hydroxyalkylsilanes. In particularly preferred embodiments, the surface attaching portion of the 

45 oligonucleotide analogue is either bis(2-hydroxyethyl)-aminopropyltriethoxysilane, n-(3-triethoxysilylpropyl)-4-hydroxy- 
butylamide, aminopropyltriethoxysilane or hydroxypropyltriethoxysilane. 

The oligoribonucleotides generated by synthesis using ordinary ribonucleotides are usually base labile due to the 
presence of the 2*-hydroxyl group. 2'-0-methyloligoribonucleotides (2'-OMeORNs), analogues of RNA where the 2- 
hydroxyl group is methylated, are DNAse and RNAse resistant, making them less base labile. Sproat, B.S., and Lam- 

so ond, A. I. in Oligonucleotides and Analogues: A Practical Approach, edited by F. Eckstein, New York: IRL Press at 
Oxford University Press, 1991 , pp. 49-86, incorporated herein by reference for all purposes, have reported the synthe- 
sis of mixed sequences of 2'-0-Methoxy-oligoribonucleotides (2'-0-MeORNs) using dimethoxytrityl phosphoramidite 
chemistry. These 2'-0-MeORNs display greater binding affinity for complementary nucleic acids than their unmodified 
counterparts. 

55 Other embodiments of the invention provide mechanical means to generate oligonucleotide analogues. These 
techniques are discussed in co-pending application SN 07/796,243, filed 1 1/22/91 , which is incorporated herein by ref- 
erence in its entirety for all purposes. Essentially, oligonucleotide analogue reagents are directed over the surface of a 
substrate such that a predefined array of oligonucleotide analogues is created. For instance, a series of channels, 
grooves, or spots are formed on or adjacent to a substrate. Reagents are selectively flowed through or deposited in the 
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channels, grooves, or spots, forming an array having different oligonucleotides and/or oligonucleotide analogues at 
selected locations on the substrate. 

Detection of Hybridization 

s 

In one embodiment, hybridization is detected by labeling a target with, e.g., fluorescein or other known visualization 
agents and incubating the target with an array of oligonucleotide analogue probes. Upon duplex formation by the target 
with a probe in the array (or triplex formation in embodiments where the array comprises unimolecular double-stranded 
probes), the fluorescein label is excited by, e.g., an argon laser and detected by viewing the array, e.g., through a scan- 
10 ning confocal microscope. 

Sequencing by hybridization 

Current sequencing methodologies are highly reliant on complex procedures and require substantial manual effort. 
75 Conventional DNA sequencing technology is a laborious procedure requiring electrophoretic size separation of labeled 
DNA fragments. An alternative approach involves a hybridization strategy carried out by attaching target DNA to a sur- 
face. The target is interrogated with a set of oligonucleotide probes, one at a time (see, application SN 
PCT/US94/12305). 

A preferred method of oligonucleotide probe array synthesis involves the use of light to direct the synthesis of oli- 
20 gonucleotide analogue probes in high-density, miniaturized arrays. Matrices of spatially-defined oligonucleotide ana- 
logue probe arrays were generated. The ability to use these arrays to identify complementary sequences was 
demonstrated by hybridizing fluorescent labeled oligonucleotides to the matrices produced. 

Oligonucleotide analogue arrays are used, e.g., to study sequence specific hybridization of nucleic acids, or pro- 
tein-nucleic acid interactions. Oligonucleotide analogue arrays are used to define the thermodynamic and kinetic rules 
25 governing the formation and stability of oligonucleotide and oligonucleotide analogue complexes. 

Oligonucleotide analogue Probe Arrays and Libraries 

The use of oligonucleotide analogues in probe arrays provides several benefits as compared to standard oligonu- 
30 cleotide arrays. For instance, as discussed supra, certain oligonucleotide analogues have enhanced hybridization char- 
acteristics to complementary nucleic acids as compared with oligonucleotides made of naturally occurring nucleotides. 
One primary benefit of enhanced hybridization characteristics is that oligonucleotide analogue probes are optionally 
shorter than corresponding probes which do not include nucleotide analogues. 

Standard oligonucleotide probe arrays typically require fairly long probes (about 15-25 nucleotides) to achieve 
35 strong binding to target nucleic acids. The use of such long probes is disadvantageous for two reasons. First, the longer 
the probe, the more synthetic steps must be performed to make the probe and any probe array comprising the probe. 
This increases the cost of making the probes and arrays. Furthermore, as each synthetic step results in less than 1 00% 
coupling for every nucleotide, the quality of the probes degrades as they become longer. Secondly, short probes provide 
better mis-match discrimination for hybridization to a target nucleic acid. This is because a single base mismatch for a 
40 short probe-target hybridization is less destabilizing than a single mismatch for a long probe-target hybridization. Thus, 
it is harder to distinguish a single probe-target mismatch when the probe is a 20-mer than when the probe is an 8-mer. 
Accordingly, the use of short oligonucleotide analogue probes reduces costs and increases mismatch discrimination in 
probe arrays. 

The enhanced hybridization characteristics of oligonucleotide analogues also allows for the creation of oligonucle- 
45 otide analogue probe arrays where the probes in the arrays have substantial secondary structure. For instance, the oli- 
gonucleotide analogue probes are optionally configured to be fully or partially double stranded on the array. The probes 
are optionally complexed with complementary nucleic acids, or are optionally unimolecular oligonucleotides with self- 
complementary regions. Libraries of diverse double-stranded oligonucleotide analogue probes are used, for example, 
in screening studies to determine binding affinity of nucleic acid binding proteins, drugs, or oligonucleotides (e.g., to 
so examine triple helix formation). Specific oligonucleotide analogues are known to be conducive to the formation of unu- 
sual secondary structure. See, Durland (1995) Bioconjugate Chem. 6: 278-282. General strategies for using unimo- 
lecular double-stranded oligonucleotides as probes and for library generation is described in application SN 
08/327,687, and similar strategies are applicable to oligonucleotide analogue probes. 

In general, a solid support, which optionally has an attached spacer molecule is attached to the distal end of the 
55 oligonucleotide analogue probe. The probe is attached as a single unit, or synthesized on the support or spacer in a 
monomer by monomer approach using the VLSIPS™ or mechanical partitioning methods described supra. Where the 
oligonucleotide analogue arrays are fully double-stranded, oligonucleotides (or oligonucleotide analogues) complemen- 
tary to the probes on the array are hybridized to the array. 
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In some embodiments, molecules other than oligonucleotides, such as proteins, dyes, co-factors, linkers and the 
like are incorporated into the oligonucleotide analogue probe, or attached to the distal end of the oligomer, e.g., as a 
spacing molecule, or as a probe or probe target. Flexible linkers are optionally used to separate complementary por- 
tions of the oligonucleotide analogue. 

s The present invention also contemplates the preparation of libraries of oligonucleotide analogues having bulges or 
loops in addition to complementary regions. Specific RNA bulges are often recognized by proteins (e.g., TAR RNA is 
recognized by the TAT protein of HIV). Accordingly, libraries of oligonucleotide analogue bulges or loops are useful in a 
number of diagnostic applications. The bulge or loop can be present in the oligonucleotide analogue or linker portions. 
Unimolecular analogue probes can be configured in a variety of ways. In one embodiment, the unimolecular probes 

10 comprise linkers, for example, where the probe is arranged according to the formula Y — L 1 —X 1 — L 2 — X 2 , in which Y 
represents a solid support, X 1 and X 2 represent a pair of complementary oligonucleotides or oligonucleotide analogues, 
L 1 represents a bond or a spacer, and L 2 represents a linking group having sufficient length such that X 1 and X 2 form 
a double-stranded oligonucleotide. The general synthetic and conformational strategy used in generating the double- 
stranded unimolecular probes is similar to that described in co-pending application SN 08/327,687, except that any of 

is the elements of the probe (L 1 , X 1 , L 2 and X 2 ) comprises a nucleotide or an oligonucleotide analogue. For instance, in 
one embodiment X 1 is an oligonucleotide analogue. 

The oligonucleotide analogue probes are optionally arranged to present a variety of moieties. For example, struc- 
tural components are optionally presented from the middle of a conformational^ restricted oligonucleotide analogue 
probe. In these embodiments, the analogue probes generally have the structure — X 11 — Z— X 12 wherein X 11 and X 12 

20 are complementary oligonucleotide analogues and Z is a structural element presented away from the surface of the 
probe array. Z can include an agonist or antagonist for a cell membrane receptor, a toxin, venom, viral epitope, hor- 
mone, peptide, enzyme, cofactor, drug, protein, antibody or the like. 

General tiling strategies for detection of a Polymorphism in a target oligonucleotide 

25 

In diagnostic applications, oligonucleotide analogue arrays (e.g., arrays on chips, slides or beads) are used to 
determine whether there are any differences between a reference sequence and a target oligonucleotide, e.g., whether 
an individual has a mutation or polymorphism in a known gene. As discussed supra, the oligonucleotide target is option- 
ally a nucleic acid such as a PCR amplicon which comprises one or more nucleotide analogues. In one embodiment, 

so arrays are designed to contain probes exhibiting complementarity to one or more selected reference sequence whose 
sequence is known. The arrays are used to read a target sequence comprising either the reference sequence itself or 
variants of that sequence. Any polynucleotide of known sequence is selected as a reference sequence. Reference 
sequences of interest include sequences known to include mutations or polymorphisms associated with phenotypic 
changes having clinical significance in human patients. For example, the CFTR gene and P53 gene in humans have 

35 been identified as the location of several mutations resulting in cystic fibrosis or cancer respectively. Other reference 
sequences of interest include those that serve to identify pathogenic microorganisms and/or are the site of mutations 
by which such microorganisms acquire drug resistance (e.g., the HIV reverse transcriptase gene for HIV resistance). 
Other reference sequences of interest include regions where polymorphic variations are known to occur (e.g., the D- 
loop region of mitochondrial DNA). These reference sequences also have utility for, e.g., forensic, cladistic, or epidemi- 

40 ological studies. 

Other reference sequences of interest include those from the genome of pathogenic viruses (e.g., hepatitis (A, B, 
or C) t herpes virus (e.g., VZV, HSV-1, HAV-6, HSV-II, CMV, and Epstein Barr virus), adenovirus, influenza virus, flaviv- 
iruses, echovirus, rhinovirus, coxsackie virus, cornovirus, respiratory syncytial virus, mumps virus, rotavirus, measles 
virus, rubella virus, parvovirus, vaccinia virus, HTLV virus, dengue virus, papillomavirus, moliuscum virus, poliovirus, 

45 rabies virus, JC virus and arboviral encephalitis virus. Other reference sequences of interest are from genomes or epi- 
somes of pathogenic bacteria, particularly regions that confer drug resistance or allow phylogenic characterization of 
the host (e.g., 16S rRNA or corresponding DNA). For example, such bacteria include chlamydia, rickettsial bacteria, 
mycobacteria, staphylococci, treptocci, pneumonococci, meningococci and conococci, klebsiella, proteus, serratia, 
pseudomonas, legioneila, diphtheria, salmonella, bacilli, cholera, tetanus, botulism, anthrax, plague, leptospirosis, and 

so Lymes disease bacteria. Other reference sequences of interest include those in which mutations result in the following 
autosomal recessive disorders: sickle cell anemia, p-thalassemia, phenylketonuria, galactosemia, Wilson's disease, 
hemochromatosis, severe combined immunodeficiency, alpha-1 -antitrypsin deficiency, albinism, alkaptonuria, lyso- 
somal storage diseases and Ehlers-Danlos syndrome. Other reference sequences of interest include those in which 
mutations result in X-linked recessive disorders: hemophilia, glucose-6-phosphate dehydrogenase, agammaglobulime- 

55 nia, diabetes insipidus, Lesch-Nyhan syndrome, muscular dystrophy, Wiskott-Aldrich syndrome, Fabry's disease and 
fragile X-syndrome. Other reference sequences of interest includes those in which mutations result in the following 
autosomal dominant disorders: familial hypercholesterolemia, polycystic kidney disease, Huntington's disease, heredi- 
tary spherocytosis, Marian's syndrome, von Willebrand's disease, neurofibromatosis, tuberous sclerosis, hereditary 
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hemorrhagic telangiectasia, familial colonic polyposis, Ehlers-Danlos syndrome, myotonic dystrophy, muscular dystro- 
phy, osteogenesis imperfecta, acute intermittent porphyria, and von Hippel-Undau disease. 

Although an array of oligonucleotide analogue probes is usually laid down in rows and columns for simplified data 
processing, such a physical arrangement of probes on the solid substrate is not essential. Provided that the spatial loca- 

5 tion of each probe in an array is known, the data from the probes is collected and processed to yield the sequence of a 
target irrespective of the physical arrangement of the probes on, e.g., a chip. In processing the data, the hybridization 
signals from the respective probes is assembled into any conceptual array desired for subsequent data reduction, what- 
ever the physical arrangement of probes on the substrate. 

In one embodiment, a basic tiling strategy provides an array of immobilized probes for analysis of a target oligonu- 

10 cleotide showing a high degree of sequence similarity to one or more selected reference oligonucleotide (e.g., detection 
of a point mutation in a target sequence). For instance, a first probe set comprises a plurality of probes exhibiting perfect 
complementarity with a selected reference oligonucleotide. The perfect complementarity usually exists throughout the 
length of the probe. However, probes having a segment or segments of perfect complementarity that is/are flanked by 
leading or trailing sequences lacking complementarity to the reference sequence can also be used. Within a segment 

15 of complementarity, each probe in the first probe set has at least one interrogation position that corresponds to a nucle- 
otide in the reference sequence. The interrogation position is aligned with the corresponding nucleotide in the reference 
sequence when the probe and reference sequence are aligned to maximize complementarity between the two. If a 
probe has more than one interrogation position, each corresponds with a respective nucleotide in the reference 
sequence. The identity of an interrogation position and corresponding nucleotide in a particular probe in the first probe 

20 set cannot be determined simply by inspection of the probe in the first set. An interrogation position and corresponding 
nucleotide is defined by the comparative structures of probes in the first probe set and corresponding probes from addi- 
tional probe sets. 

For each probe in the first set, there are, for purposes of the present illustration, multiple corresponding probes from 
additional probe sets. For instance, there are optionally probes corresponding to each nucleotide of interest in the ref- 
25 erence sequence. Each of the corresponding probes has an interrogation position aligned with that nucleotide of inter- 
est. Usually, the probes from the additional probe sets are identical to the corresponding probe from the first probe set 
with one exception. The exception is that at the interrogation position, which occurs in the same position in each of the 
corresponding probes from the additional probe sets. This position is occupied by a different nucleotide in the corre- 
sponding probe sets. Other tiling strategies are also employed, depending on the information to be obtained. 
30 The probes are oligonucleotide analogues which are capable of hybridizing with a target nucleic sequence by com- 
plementary base-pairing. Complementary base pairing includes sequence-specific base pairing, which comprises, 
e.g., Watson-Crick base pairing or other forms of base pairing such as Hoogsteen base pairing. The probes are 
attached by any appropriate linkage to a support. 3' attachment is more usual as this orientation is compatible with the 
preferred chemistry used in solid phase synthesis of oligonucleotides and oligonucleotide analogues (with the excep- 
ts tion of, e.g., analogues which do not have a phosphate backbone, such as peptide nucleic acids). 

EXAMPLES 

The following examples are provided by way of illustration only and not by way of limitation. A variety of parameters 
40 can be changed or modified to yield essentially similar results. 

One approach to enhancing oligonucleotide hybridization is to increase the thermal stability (T m ) of the duplex 
formed between the target and the probe using oligonucleotide analogues that are known to increase T m 's upon hybrid- 
ization to DNA. Enhanced hybridization using oligonucleotide analogues is described in the examples below, including 
enhanced hybridization in oligonucleotide arrays. 

45 

Example 1: solution oligonucleotide melting T m 

The T m of 2'O-methyl oligonucleotide analogues was compared to the T m for the corresponding DNA and RNA 
sequences in solution. In addition, the T m of 2'-0-methyl oligonucleotide:DNA, 2'-0-methy1 oligonucleotide:RNA and 

so RNA:DNA duplexes in solution was also determined. The T m was determined by varying the sample temperature and 
monitoring the absorbance of the sample solution at 260 nm. The oligonucleotide samples were dissolved in a 0.1 M 
NaCI solution with an oligonucleotide concentration of 2jiM. Table 1 summarizes the results of the experiment. The 
results show that the hybridization of DNA in solution has approximately the same T m as the hybridization of DNA with 
a 2'-0-methyl-substituted oligonucleotide analogue. The results also show that the T m for the 2'-0-methyl-substituted 

55 oligonucleotide duplex is higher than that for the corresponding RNA:2'-0-methyl-substituted oligonucleotide duplex, 
which is higher than the T m for the corresponding DNA:DNA or RNA:DNA duplex. 
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TABLE 1 



Solution Oligonucleotide Melting Experiments 
(+) = Target Sequence (5-CTGAACGGTAGCATCTTGAC- 

3')(SEQ ID NO:6)* 
(-) = Complementary Sequence (5'GTCAAGATGCTACCGT- 
TCAG-3')(SEQ ID NO:7)* 


Type of Oligonucleotide, 
Target Sequence (+) 


Type of Oligonucleotide, 
Complementary 
Sequence (+) 


T m (°C) 


DNA (+) 


DNA(-) 


61.6 


DNA(+) 


2'0Me(-) 


58.6 


2'0Me(+) 


DNA(-) 


61.6 


2'0Me(+) 


2'0Me(-) 


78.0 


RNA(+) 


DNA(-) 


58.2 


RNA(+) 


2'0Me(-) 


73.6 



* T refers to thymine for the DNA oligonucleotides, or uracil for 
the RNA oligonucleotides. 



Example 2: array hybridization experiments with DNA chips and oligonucleotide analogue targets 

A variable length DNA probe array on a chip was designed to discriminate single base mismatches in the 3 corre- 
sponding sequences 

5'-CTGAACGGTAGCATCTTGAC-3' (SEQ ID NO:6)(DNA target), 
5'-CUGAACGGUAGCAUCUUGAC-3' (SEQ ID NO:8)(RNA target) and 

5'-CUGAACGGUAGCAUCUUGAC-3' (SEQ ID NO:9)(2'-0-methyl oligonucleotide target), and generated by the * 
VLSIPS™ procedure. The Chip was designed with adjacent 1 2-mers and 8-mers which overlapped with the 3 target 
sequences as shown in Table 2. 



Table 2: Array hybridization Experiments 


Target 1 (DNA) 

8-mer probe (complement) 

12-mer probe (complement) 


5 ' -CTGAACGGTAGCATCTTGAC-3 ' (SEQ ID NO: 6) 


Target 2 (RNA) 

8-mer probe (complement) 

12-mer probe (complement) 


5'~CUGAACGGUAGCAUCUUGAC-3' (SEQ ID NO: 8) 


Target 3 (2'-0-Me oligo) 
8-mer probe (complement) 
12-mer probe (complement) 


5'-CUGAACGGUAGCAUCUUGAC-3' (SEQ ID NO: 9) 
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Target oligos were synthesized using standard techniques. The DNA and 2 , -0-methyl oligonucleotide analogue tar- 
get oligonucleotides were hybridized to the chip at a concentration of 10nM in 5x SSPE at 20°C in sequential experi- 
ments. Intensity measurements were taken at each probe position in the 8-mer and 12-mer arrays over time. The rate 
of increase in intensity was then plotted for each probe position. The rate of increase in intensity was similar for both 

5 targets in the 8-mer probe arrays, but the 12-mer probes hybridized more rapidly to the DNA target oligonucleotide. 

Plots of intensity versus probe position were generated for the RNA, DNA and 2-0 -methyl oligonucleotides to 
ascertain mismatch discrimination. The 8-mer probes displayed similar mismatch discrimination against all targets. The 
12-mer probes displayed the highest mismatch discrimination for the DNA targets, followed by the 2'-0-methyl target, 
with the RNA target showing the poorest mismatch discrimination. 

10 Thermal equilibrium experiments were performed by hybridizing each of the targets to the chip for 90 minutes at 
5°C temperature intervals. The chip was hybridized with the target in 5x SSPE at a target concentration of 10nM. Inten- 
sity measurements were taken at the end of the 90 minute hybridization at each temperature point as described above. 
All of the targets displayed similar stability, with minimal hybridization to the 8-mer probes at 30°C. In addition, all of the 
targets showed similar stability in hybridizing to the 12-mer probes. Thus, the 2'-0-methyl oligonucleotide target had 

is similar hybridization characteristics to DNA and RNA targets when hybridized against DNA probes. 

Example 3: 2'-0-methyl-substituted oligonucleotide chips 

DMT-protected DNA and 2'-0-methyl phosphoramidites were used to synthesize 8-mer probe arrays on a glass 
20 slide using the VLSIPS™ method. The resulting chip was hybridized to DNA and RNA targets in separate experiments. 
The target sequence, the sequences of the probes on the chip and the general physical layout of the chip is described 
in Table 3. 

The chip was hybridized to the RNA and DNA targets in successive experiments. The hybridization conditions used 
were 10nM target, in 5x SSPE. The chip and solution were heated from 20°C to 50°C, with a fluorescence measure- 
rs ment taken at 5 degree intervals as described in SN PCT/US94/123Q5. The chip and solution were maintained at each 
temperature for 90 minutes prior to fluorescence measurements. The results of the experiment showed that DNA 
probes were equal or superior to 2'-0-methyl oligonucleotide analogue probes for hybridization to a DNA target, but that 
the 2'-0-methyl analogue oligonucleotide probes showed dramatically better hybridization to the RNA target than the 
DNA probes. In addition, the 2'-0-methyl analogue oligonucleotide probes showed superior mismatch discrimination of 
30 the RNA target compared to the DNA probes. The difference in fluorescence intensity between the matched and mis- 
matched analogue probes was greater than the difference between the matched and mismatched DNA probes, dramat- 
ically increasing the signal-to-noise ratio. Figure 1 displays the results graphically (Figure 1A). (M) and (P) indicate 
mismatched and perfectly matched probes, respectively. Figure 1 B illustrates the fluorescence intensity versus location 
on an example chip for the various probes at 20°C using RNA and DNA targets. 

35 
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Table 3: 2'-0-methyl Oligonucleotide Analogues on a Chip. 



Target Sequence (DNA): 

Target Sequence (RNA): 

Matching DNA oligonucleotide probe {DNA 
(M)} 

Matching 2'-0-methyl oligonucleotide 
analogue probe {2'OMe (M)} 

DNA oligonucleotide probe with 1 base 
mismatch {DNA (P)} 

2 , -0-methyl oligonucleotide analogue probe 
with 1 base mismatch {2'OMe (M)} 



5^CTGAACGGTAGCATCTTGAC-3' 
(SEQ ID NO:6) 

5'-CUGAACGGUAGCAUCUUGAC-3' 
(SEQ ID NO:8) 

S'-CTTGCCAT (SEQ ID NO: 10) 
5'-CUUGCCAU (SEQ ID NO: 11) 
S'-CTTGCTAT (SEQ ID NO: 12) 
S'-CUUGCUAU (SEQ ID NO: 13) 



SCHEMATIC REPRESENTATION OF 2 , -0-METHYL/DNA CHIP 



Matching 2 , -0-methyl oligonucleotide analogue probe 
2'-0-methyl oligonucleotide analogue probe with 1 base mismatch 
DNA oligonucleotide probe with 1 base mismatch 
Matching DNA oligonucleotide probe 



Example 4: synthesis of oligonucleotide analogues 

The reagent MeNPoc-CI group reacts non-selectively with both the 5' and 3' hydroxyls on 2-O-methyl nucleoside 
analogues. Thus, to generate high yields of S'-O-MeNPoc^'-O-methylribonucleoside analogues for use in oligonucle- 
otide analogue synthesis, the following protection -deprotection scheme was utilized. 

The protective group DMT was added to the 5 -0 position of the 2'-0-methylribonucleoside analogue in the pres- 
ence of pyridine. The resulting 5 -O-DMT protected analogue was reacted with TBDMS-Triflate in THF. resulting in the 
addition of the TBDMS group to the 3'-0 of the analogue. The 5'-DMT group was then removed with TCAA to yield a 
free OH group at the 5* position of the 2'-0-methyl ribonucleosfcJe analogue, followed by the addition of MeNPoc-CI in 
the presence of pyridine, to yield S'-O-MeNPoc-S'-O-TBDMS-Z-O-methyl ribonucleoside analogue. The TBDMS group 
was then removed by reaction with NaF, and the 3'-0H group was phosphitylated using standard techniques. 

Two other potential strategies did not result in high specific yields of S'-O-MeNPoc^'-O-methylribonucleoside. In 
the first, a less reactive MeNPoc derivative was synthesized by reacting MeNPoc-CI with N-hydroxy succimide to yield 
MeNPoc-NHS. This less reactive photocleavable group (MeNPoc-NHS) was found to react exclusively with the 3' 
hydroxy! on the 2'-0-methylribonucleoside analogue. In the second strategy, an organotin protection scheme was used. 
Dibutyttin oxide was reacted with the 2'-0-methylribonucleoside analogue followed by reaction with MeNPoc. Both 5'- 
O-MeNPoc and 3'-0-MeNPoc 2'-0-methylribonucieoside analogues were obtained. 

Example 5: hybridization to mixed-sequence oligodeoxynucleotide probes substituted with 2-amino-2'<leoxyadenosme 
(D) 

To test the effect of a 2-amino-2'-deoxyadenosine (D) substitution in a heterogeneous probe sequence, two 4x4 oli- 
godeoxynucleotide arrays were constructed using VLSIPS™ methodology and 5'-0-MeNPOC-protected deoxynucleo- 



14 



EP0 742 287 A2 



side phosphoramidites. Each array was comprised of the following set of probes based on the sequence (3> 
CATCGTAGAA-(5') (SEQ ID NO: 1): 

1. -(HEG)-(3')-CATNiGTAGAA-(5') (SEQ ID NO:14) 
5 2. -(HEG)-(3>CATCN 2 TAGAA-(5') (SEQ ID NO:15) 

3. -(HEG)-(3>CATCGN 3 AGAA-(5') (SEQ ID NO:16) 

4. -(HEG)-(3>CATCGTfcUGAA-(5') (SEQ ID NO: 17) 

where HEG = hexaethyleneglycol linker, and N is either A.G.C or T, so that probes are obtained which contain single 
10 mismatches introduced at each of four central locations in the sequence. The first probe array was constructed with all 
natural bases. In the second array, 2-amino-2'-deoxyadenosine (D) was used in place of adenosine (A). Both arrays 
were hybridized with a 5'-f luorescein-labeled oligodeoxynucleotide target, (5>FI-d(CTGAACGGTAGCATCTTGAC)-(3') 
(SEQ ID NO:18), which contained a sequence (in bold) complementary to the base probe sequence. The hybridization 
conditions were: 10nM target in 5xSSPE buffer at 22°C with agitation. After 30 minutes, the chip was mounted on the 
15 f lowcell of a scanning laser confocal fluorescence microscope, rinsed briefly with 5xSSPE buffer at 22°C, and then a 
surface fluorescence image was obtained. 

The relative efficiency of hybridization of the target to the complementary and single-base mismatched probes was 
determined by comparing the average bound surface fluorescence intensity in those regions of the of the array contain* 
ing the individual probe sequences. The results (Figure 3) show that a 2-amino-2 , -deoxyadenosine (D) substitution in a 
20 heterogeneous probe sequence is a relatively neutral one, with little effect on either the signal intensity or the specificity 
of DNA-DNA hybridization, under conditions where the target is in excess and the probes are saturated. 

Example 6: hybridization to a dA-homopolymer oligodeoxynucleotide probe substituted with 2-amino-2'-deoxyadenos- 
ine (D) 

25 

The following experiment was performed to compare the hybridization of 2*-deoxyadenosine containing homopoly- 
mer arrays with 2-amino-2'-deoxyadenosine homopolymer arrays. The experiment was performed on two 1 1 -mer oligo- 
deoxynucleotide probe containing arrays. Two 11 -mer oligodeoxynucleotide probe sequences were synthesized on a 
chip using S'-O-MeNPOC-protected nucleoside phosphoramidites and standard VLSIPS™ methodology. 

30 The sequence of the first probe was: 

(HEG)-(3> d(AAAAANAAAAA)-(5') (SEQ ID NO:19); where HEG = hexaethyleneglycol linker, and N is either A.G.C or 
T. The second probe was the same, except that dA was replaced by 2-amino-2'-deoxyadenosine (D). The chip was 
hybridized with a S'-fluorescein-labeled oligodeoxynucleotide target, (5>FI-d(TTTTTGTTTTT)-(3 r ) (SEQ ID NO:20), 
which contained a sequence complementary to the probe sequences where N=C. Hybridization conditions were 1 0nM 

35 target in SxSSPE buffer at 22°C with agitation. After 15 minutes, the chip was mounted on the flowcell of a scanning 
laser confocal fluorescence microscope, rinsed briefly with 5xSSPE buffer at 22°C (low stringency), and a surface flu- 
orescence image was obtained. Hybridization to the chip was continued for another 5 hours, and a surface fluorescence 
image was acquired again. Finally, the chip was washed briefly with 0.5xSSPE (high-stringency), then with SxSSPE, 
and re-scanned. 

40 The relative efficiency of hybridization of the target to the complementary and single-base mismatched probes was 
determined by comparing the average bound surface fluorescence intensity in those regions of the of the array contain- 
ing the individual probe sequences. The results (Figure 4) indicate that substituting 2 , -deoxyadenosine with 2-amino-2'- 
deoxyadenosine in a d(A) n homopolymer probe sequence results in a significant enhancement in specific hybridization 
to a complementary oligodeoxynucleotide sequence. 

45 

Example 7: hybridization to alternating A~T oligodeoxynucleotide probes substituted with 5-propynyl-2'-deoxyuridine 
(P) and 2-amino-2'-deoxyadenosine (D) 

Commercially available 5'-DMT-protected 2'-deoxynucleoside/nucleoside-analog phosphoramidites (Glen 
so Research) were used to synthesize two decanucleotide probe sequences on separate areas on a chip using a modified 
VLSIPS™ procedure. In this procedure, a glass substrate is initially modified with a terminal-MeNPOC-protected hexa- 
ethyleneglycol linker. The substrate was exposed to light through a mask to remove the protecting group from the linker 
in a checkerboard pattern. The first probe sequence was then synthesized in the exposed region using DMT-phospho- 
ramidites with acid-deprotection cycles, and the sequence was finally capped with (MeO) 2 PNiPr 2 /tetrazole followed by 
55 oxidation. A second checkerboard exposure in a different (previously unexposed) region of the chip was then per- 
formed, and the second probe sequence was synthesized by the same procedure. The sequence of the first "control" 
probe was: -(HEG)-(3 , )-CGCGCCGCGC-(5 , ) (SEQ ID NO:21); and the sequence of the second probe was one of the 
following: 
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1. -(HEGM3^(ATATAATATA)(5*) (SEQ ID NO:22) 

2. -(HEGM3 , )<KAPAPAAPAPA)-(5') (SEQ ID NO:23) 

3. -(HEG)-(3')-d(DTDTDDTDTD)-(5') (SEQ ID NO:24) 

4. -(HEG)-(3')-d(DPDPDDPDPD)-(5') (SEQ ID NO:25) 

5 

where HEG = hexaethyleneglycol linker, A = 2'-deoxyadenosine, T = thymidine, D = 2-amino-2'-deoxyadenosine, and P 
= 5-propynyl-2'-deoxyuridine. Each chip was then hybridized in a solution of a f luorescein-labeled oligodeoxynucleotide 
target, (5>Fluorescein-d(TATATTATAT)-(HEG)-d(GCGCGGCGCG)-(3') (SEQ ID NO:26 and SEQ ID NO:27), which is 
complementary to both the A/T and G/C probes. The hybridization conditions were: 10nM target in SxSSPE buffer at 
10 22°C with gentle shaking. After 3 hours, the chip was mounted on the f lowcell of a scanning laser confocal fluorescence 
microscope, rinsed briefly with 5xSSPE buffer at 22°C, and then a surface fluorescence image was obtained. Hybridi- 
zation to the chip was continued overnight (total hybridization time = 20hr), and a surface fluorescence image was 
acquired again. 

The relative efficiency of hybridization of the target to the A/T and substituted A/T probes was determined by com- 
15 paring the average surface fluorescence intensity bound to those parts of the chip containing the A/T or substituted 
probe to the fluorescence intensity bound to the G/C control probe sequence. The results (Figure 5) show that 5-propy- 
nyl-dU and 2-amino-dA substitution in an A/T-rich probe significantly enhances the affinity of an oligonucleotide ana- 
logue for complementary target sequences. The unsubstituted A/T-probe bound only 20% as much target as the all- 
G/C-probe of the same length, while the D- & P-substituted A/T probe bound nearly as much (90%) as the G/C-probe. 
20 Moreover, the kinetics of hybridization are such that, at early times, the amount of target bound to the substituted A/T 
probes exceeds that which is bound to the all-G/C probe. 

Example 8: hybridization to oligodeoxynucleotide probes substituted with 7-deaza-2'-deoxyguanosine (ddG) and 2'- 
deoxyinosine (dl) 

25 

A 16x64 oligonucleotide array was constructed using VLSIPS™ methodology, with 5'-0-MeNPOC-protected nucle- 
oside phosphoramidites, including the analogs ddG, and dl. The array was comprised of the set of probes represented 
by the following sequence: 

30 -(linkerJ-CT-dlAIGIIG! G 2 G3G4G5CGGGD -(5'); (SEQ ID NO:28) where underlined bases are fixed, and 
the five internal deoxyguanosines (G^s) are substituted with G, ddG, dl, and T in all possible (1024 total) combina- 
tions. A complementary oligonucleotide target, labeled with fluorescein at the 5'-end: 

(5>FI- d(C A A T A C A A C C C C C G C C C A T C C) -(3*) (SEQ ID NO:29), was hybridized to the array. The 
hybridization conditions were: 5 nM target in 6 x SSPE buffer at 22°C with shaking. After 30 minutes, the chip was 
35 mounted on the f lowcell of an Affymetrix scanning laser confocal fluorescence microscope, rinsed once with 0.25 
x SSPE buffer at 22°C, and then a surface fluorescence image was acquired. 

The "efficiency" of target hybridization to each probe in the array is proportional to the bound surface fluorescence 
intensity in the region of the chip where the probe was synthesized. The relative values for a subset of probes (those 
40 containing dG->ddG id dG->dl substitutions only) are shown in Figure 6. Substitution of guanosine with 7-deazaguano- 
sine within the internal run of five G's results in a significant enhancement in the fluorescence signal intensity which 
measures hybridization. Deoxyinosine substitutions also enhance hybridization to the probe, but to a lesser extent. In 
this example, the best overall enhancement is realized when the dG run" is - 40-60% substituted with 7-deaza-dG, with 
the substitutions distributed evenly throughout the run (i.e., alternating dG / deaza-dG). 

45 

Example 9: Synthesis of 5- MeNPOC-2'-deoxyinosine-3' -(N t N-diisopropyl-2<yanoethyl)phosphoramidite 

2'-deoxyinosine (5.0g, 20 mmole) was dissolved in 50 ml of dry DMF, and 1 00 ml dry pyridine was added and evap- 
orated three times to dry the solution. Another 50ml pyridine was added, the solution was cooled to -20°C under argon, 

so and 13.8g (50 mmole) of MeNPOC-chloride in 20 ml dry DCM was then added dropwise with stirring over 60 minutes. 
After 60 minutes, the cold bath was removed, and the solution was allowed to stir overnight at room temperature. Pyri- 
dine and DCM were removed by evaporation, 500 ml of ethyl acetate was added, and the solution was washed twice 
with water and then with brine (200ml each). The aqueous washes were combined and back-extracted twice with ethyl 
acetate, and then all of the organic layers were combined, dried with Na 2 S0 4 , and evaporated under vacuum. The prod- 

55 uct was recrystallized from DCM to obtain 5.0g (50% yield) of pure 5'-0-MeNPOC-2 '-deoxyinosine as a yellow solid 
(99% purity, according to 1 H-NMR and HPLC analysis). 

The MeNPOC-nucleoside (2.5g, 5. 1 mmole) was suspended in 60 ml of dry CH 3 CN and phosphitylated with 2- cya- 
noethyl N.N.N'.N'-tetraisopropylphosphorodiamidite (1 .65 g / 1 .66 ml; 5.5 mmole) and 0.47 g (2.7 mmole) of diisopro- 
pylammonium tetrazolide, according to the published procedure of Barone, etaL (Nucleic Acids Res. (1984) 12, 4051- 
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61). The crude phosphoramidite was purified by flash chromatography on silica gel (90:8:2 DCM-MeOH-Et 3 N), co- 
evaporated twice with anhydrous acetonitrile and dried under vacuum for ~ 24 hours to obtain 2.8g (80%) of the pure 
product as a yellow solid (98% purity as determined by 1 H/ 31 P-NMR and HPLC). 

5 Example 10: synthesis of 5'-MeNPOC-7<ieaza-2'-deoxy(N2'isobutyryiyguanosw^ 
thyl)phosphoramidite. 

The protected nucleoside 7-deaza-2'-deoxy(N2- isobutyryl)guanosine (1.0g, 3 mmole; Chemgenes Corp., 
Waltham, MA) was dried by co-evaporating three times with 5 ml anhydrous pyridine and dissolved in 5 ml of dry pyri- 

10 dine-DCM (75:25 by vol.). The solution was cooled to -45oC (dry ice/CH 3 CN) under argon, and a solution of 0.9g (3.3 
mmole) MeNPOC-CI in 2 ml dry DCM was then added dropwise with stirring. After 30 minutes, the cold bath was 
removed, and the solution allowed to stir overnight at room temperature. The solvents were evaporated, and the crude 
material was purified by flash chromatography on silica gel (2.5% - 5% MeOH in DCM) to yield 1 .5g (88%yield) 5'- MeN- 
POC-7<leaza-2 , <leoxy(N2-isobutyryl)guanosine as a yellow foam. The product was 98% pure according to 1 H-NMR 

15 and HPLC analysis. 

The MeNPOC-nucleoside (1 .25g, 2.2 mmole) was phosphitylated according to the published procedure of Barone, 
et aL {Nucleic Acids Res. (1984) 12, 4051-61). The crude product was purified by flash chromatography on silica gel 
(60:35:5 hexane-ethyl acetate-EtaN), co-evaporated twice with anhydrous acetonitrile and dried under vacuum for -24 
hours to obtain 1 .3g (75%) of the pure product as a yellow solid (98% purity as determined by 1 H/ 31 P-NMR and HPLC). 

20 

Example 11: synthesis of 5'-MeNPOC-2,6-bis(phenoxyacetyl) -2,6-diaminopurine -2'-deoxyriboside-3'-(N,N-diisopro- 
pyl-2- cyanoethyl)phosphoramidite. 

The protected nucleoside 2,6-bis(phenoxyacetyl) -2,6-diaminopurine-2'-deoxyriboside (8 mmole, 4.2 g) was dried 
25 by coevaporating twice from anhydrous pyridine, dissolved in 2:1 pyridine/DCM (17.6 ml) and then cooled to -40 °C. 
MeNPOC-chloride (8 mmole, 2. 1 8 g) was dissolved in DCM (6.6mls) and added to reaction mixture dropwise. The reac- 
tion was allowed to stir overnight with slow warming to room temperature. After the overnight stirring, another 2 mmole 
(0.6 g) in DCM (1 .6 ml) was added to the reaction at -40 °C and stirred for an additional 6 hours or until no unreacted 
nucleoside was present. The reaction mixture was evaporated to dryness, and the residue was dissolved in ethyl ace- 
30 tate and washed with water twice, followed by a wash with saturated sodium chloride. The organic layer was dried with 
MgS0 4> and evaporated to a yellow solid which was purified by flash chromatography in DCM employing a methanol 
gradient to elute the desired product in 51% yield. 

The 5'-MeNPOC-nucleoside (4.5 mmole, 3.5 g) was phosphitylated according to the published procedure of Bar- 
one, era/. (Nucleic Acids Res. (1984) 12, 4051-61). The crude product was purified by flash chromatography on silica 
35 gel (99:0.5:0.5 DCM-MeOH-Et3N). The pooled fractions were evaporated to an oil, redissolved in a minimum amount of 
DCM, precipitated by the addition of 800 ml ice cold hexane, filtered, and then dried under vacuum for - 24 hours. 
Overall yield was 56%, at greater than 96% purity by HPLC and 1 H/ 31 P-NMR. 

Example 12: 5'-0-MeNPOC-protected phosphoramidites for incorporating 7-deaza-2'deoxyguanosine and 2'-deoxyino- 
40 sine into VLSSIPS™ Oligonucleotide Arrays 

VLSIPS oligonucleotide probe arrays in which all or a subset of all guanosine residues are substitutes with 7- 
deaza-2'-deoxyguanosine and/or 2 , -deoxyinosine are highly desirable. This is because guanine-rich regions of nucleic 
acids associate to form multi-stranded structures. For example, short tracts of G residues in RNA and DNA commonly 

45 associate to form tetrameric structures (Zimmermin et al. (1975) J. Mol. Biol. 92: 181 ; Kim, J. (1991) Nature 351 : 331 ; 
Sen etal. (1988) Nature 335: 364; and Sunquist era/. (1989) Nature 342: 825). The problem this poses to chip hybrid- 
ization-based assays is that such structures may compete or interfere with normal hybridization between complemen- 
tary nucleic acid sequences. However, by substituting the 7-deaza-G analog into G-rich nucleic acid sequences, 
particularly at one or more positions within a run of G residues, the tendency for such probes to form higher-order struc- 

so tures is suppressed, while maintaining essentially the same affinity and sequence specificity in double-stranded struc- 
tures. This has been exploited in order to reduce band compression in sequencing gels (Mizusawa, et al. ( 1 986) N.A.R. 
14: 1319) to improve target hybridization to G-rich probe sequences in VLSIPS arrays. Similar results are achieved 
using inosine (see also, Sanger et al. (1977) RN.A.S. 74: 5463). 

For facile incorporation of 7-deaza-2'-deoxyguanosine and 2*-deoxyinosine into oligonucleotide arrays using 

55 VLSIPS™ methods, a nucleoside phosphoramidite comprising the analogue base which has a S'-O'-MeNPOC-protect- 
ing group is constructed. This building block was prepared from commercially available nucleosides according to 
Scheme I. These amidites pass the usual tests for coupling efficiency and photolysis rate. 
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Although the foregoing invention has been described in some detail by way of illustration and example for purposes 
of clarity of understanding, modifications can be made thereto without departing from the spirit or scope of the 
25 appended claims. 

All publications and patent applications cited in this application are herein incorporated by reference for all pur- 
poses as if each individual publication or patent application were specifically and individually indicated to be incorpo- 
rated by reference. 
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SEQUENCE LISTING 



5 



(1) GENERAL INFORMATION: 



(i) 



APPLICANT: McGall, Glenn Hugh 

Miyada, Charles Garrett 



10 



Cronin, Maureen T. 
Tan, Jennifer Dee 
Chee, Mark 



(ii) TITLE OF INVENTION: Modified Nucleic Acid Probes 
(iii) NUMBER OF SEQUENCES : 29 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Hepworth Lawrence Bryer & Bizley 

(B) ADDRESS: Merlin House 



(C) POSTCODE: CM16 5DQ 

(D) COUNTRY: UK 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: Not yet assigned 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/440,742 

(B) FILING DATE: 10-MAY-1995 

(viii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: Not yet assigned 

(B) FILING DATE: 03-APR-1996 

(ix) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Bizley, Richard Edward 

(B) REFERENCE/DOCKET NUMBER: APEP96235 



20 



Falconry Court 
Bakers Lane 
Epping 
Essex 



50 



<X) 



TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (0)1992 561756 

(B) TELEFAX: (0)1992 561934 
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<ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
AAGATGCTAC 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
AAAAANAAAA A 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ATATAATATA 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CGCGCCGCGC 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 6.-10 

(D) OTHER INFORMATION: /raod_base= OTHER 

/note« "n = guanosine (G) , 
2* , 3 '-dideoxyguanine (ddG), 
2'-deoxyinosine (dl) or thymine (T) " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TGGCCNNNNN TTGTA 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..20 

(D) OTHER INFORMATION: /note- "Target DNA sequence" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CTGAACGGTA GCATCTTGAC 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1..20 

(D) OTHER INFORMATION: /note= "Complementary DNA sequence" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GTCAAGATGC TACCGTTCAG 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA 
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(ix) FEATURE:- 

<A) NAME/KEY: - 

( B ) LOCATION : 1 • • 20 

(D) OTHER INFORMATION: /note= "Target RNA sequence" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CUGAACGGUA GCAUCUUGAC 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "2 '-O-methyl oligonucleotide" 



(ix) FEATURE: 

(A) NAME /KEY: raodif ied_base 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /mod_base= cm 

(ix) FEATURE: 

(A) NAME /KEY: modif iedjaase 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /raod_baee= urn 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /mod_baae= gm 

(ix) FEATURE: 

(A) NAME /KEY: raodif iedjbase 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /mod_base« OTHER 

/note= "2 '-O-methyl adenosine" 

( ix ) FEATURE : 

(A) NAME/KEY: modif ied_baae 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note«» H 2 *-0-methyl adenosine" 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base= cm 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /mod_base= gm 

( ix ) FEATURE : 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /mod_baee= gm 
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(ix) FEATURE: 

(A) NAME/KEY: raodif ied_baae 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_baee= urn 

(ix) FEATURE: 

(A) NAME/KEY: modified baae 
<B) LOCATION: 10 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= m 2 '-O-methy 1 adenosine" 

(ix) FEATURE: 

(A) NAME/KEY: modified baae 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /mod_baae* gm 

(ix) FEATURE: 

(A) NAME /KEY: modif iedjbaee 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /mod_baae= cm 

(ix) FEATURE: 

(A) NAME /KEY: modif ied_baae 

(B) LOCATION: 13 

<D) OTHER INFORMATION: /mod_baee= OTHER 

/note= "2'-0-methyladenoaine M 

(ix) FEATURE: 

(A) NAME/KEY: modified baae 

(B) LOCATION: 14 

(D) OTHER INFORMATION: /mod_baee= urn 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_baee 
30 (B) LOCATION: IS 

(D) OTHER INFORMATION: /mod_baae= cm 

(ix) FEATURE: 

(A) NAME/KEY: modified baae 
<B) LOCATION: 16 
35 (D) OTHER INFORMATION: /mod_baae= urn 

(ix) FEATURE: 

(A) NAME/KEY: modified baae 

(B) LOCATION: 17 

(D) OTHER INFORMATION: /mod_baee= urn 

40 

(ix) FEATURE: 

(A) NAME /KEY: modified baae 

(B) LOCATION: 18 

<D) OTHER INFORMATION: /raod_baae= gm 

45 (ix) FEATURE: 

(A) NAME /KEY: modified baae 

(B) LOCATION: 19 

(D) OTHER INFORMATION: /mod_baae= OTHER 

/note= "2* -O-methy 1 adenosine" 

50 (ix) FEATURE: 

(A) NAME/KEY: modified baae 

(B) LOCATION: 20 " 

(D) OTHER INFORMATION: /mod baee= cm 



55 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..20 

(D) OTHER INFORMATION: /note- "Target 2'-0-methyl 

oligonucleotide sequence" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOx9: 
NNNNNNNNNN NNNNNNNNNN 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..8 

(D) OTHER INFORMATION: /note= "Matching DNA oligonucleotide 

probe" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTTGCCAT 



(2) INFORMATION FOR SEQ ID NO: 11: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid ' 

(C) STRANDEDNESS: single 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /deoc = "2'-0-methyl oligonucleotide" 



(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /raod_base= cm 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_baee 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /mod_base« urn 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /mod_base= urn 

( ix ) FEATURE : 

(A) NAME /KEY: modif iedjbase 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /mod_base= gm 
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(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /mod_base= cm 

(ix) FEATURE: 

(A) NAME/KEY: modified baee 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /modjDase= cm 

(ix) FEATURE: 

(A) NAME /KEY: modif iedbase 
<B) LOCATION: 7 

CD) OTHER INFORMATION: /mod_base= OTHER 

/note= "2 '-O-methy lade no sine" 

(ix) FEATURE: 
15 (A) NAME/KEY: modif ied_base 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /mod_base= urn 

(ix) FEATURE: 

(A) NAME/KEY: - 

20 (B) LOCATION: 1..8 

(D) OTHER INFORMATION: /note= "Matching 2'-0-methyl 

oligonucleotide analogue probe" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
25 NNNNNNNN 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 8 base paire 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 
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(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1..8 

(D) OTHER INFORMATION: /note= "DNA oligonucleotide probe with 1 

base mismatch" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CTTGCTAT q 

(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "2 * -O-methyl oligonucleotide" 
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{ ix ) FEATURE : 

(A) NAME/KEY: modified base 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /mod_base= cm 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /mod_base= urn 

(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /mod_base= um 

(ix) FEATURE: 

(A) NAME/ KEY: modified baee 

(B) LOCATION: 4 " 

(D) OTHER INFORMATION: /mod_baae= gm 

(ix) FEATURE: 

(A) NAME /KEY: modified base 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /mod_base= cm 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base= um 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /raodjbase- OTHER 

/note= ^'-O-raethyladenosine* 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /mod_base« um 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..8 

(D) OTHER INFORMATION: /note- "2 '-O-methyl oligonucleotide 

analogue probe with 1 base mismatch" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
NNNNNNNN 

45 

(2) INFORMATION FOR SEQ ID NO: 14: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



55 
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(ix) FEATURE; 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /mod_baae= OTHER 

/note- "N « cytosine covalently 
modified at the 3' phoaphate group with 
a hexaethyleneglycol (HEG) linker" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
AAGATGNTAN 10 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

( B) LOCATION: 10 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = cytosine covalently modified 
at the 3' phosphate group with a 
hexaethyleneglycol (HEG) linker" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
AAGATNCTAN 10 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS x single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = cytosine covalently modified 
at the 3' phosphate group with a 
hexaethyleneglycol (HEG) linker* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AAGANGCTAN 10 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: eingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = cytoaine covalently modified 
at the 3 ' phosphate group with a 
hexaethyleneglycol (HEG) linker" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
20 AAG NTGCTAN 10 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 20 base pairs 

< B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



40 



45 



50 



(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /mod_base«= OTHER 
35 /note= "N = cytosine covalently modified 

at the 5' phosphate group with a 
fluorescein molecule" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
NTGAACGGTA GCATCTTGAC 20 

(2) INFORMATION FOR SEQ ID NO: 19 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /mod_base* OTHER 

/note= "N = adenine covalently modified 
at the 3' phosphate group with a 
hexaethyleneglycol (HEG) linker" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AAAAANAAAA N v 11 



(2) INFORMATION FOR SEQ ID NO: 20: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

<A) NAME/KEY: modif ied_base 
(B) LOCATION: 1 
25 (D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N as thymine covalently modified 
at the 5' phosphate group with a 
fluorescein molecule" 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

NTTTTGTTTT T 11 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



45 



(ix) FEATURE: 

(A) NAME /KEY: modified base 

(B) LOCATION: 10 

(D) OTHER INFORMATION 



/modjDase= OTHER 
/note* "N = cytosine covalently modified 
at the 3' phosphate group with a 
hexaethyleneglycol (HEG) linker" 



50 (xi) SEQUENCE DESCRIPTION : SEQ ID NO:21: 

CGCGCCGCGN 



10 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - w 2 ' -deoxy nucleoside/nucleoside 
10 analogue decanucleotide probe" 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 1 

15 (D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = 2 ' -deoxyadenosine- 
tix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = 2 ' -deoxyadenosine" 



20 



(ix) FEATURE: 

(A) NAME /KEY: modified base 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /mod__base= OTHER 
25 /note= "N = 2 ' -deoxyadenosine" 

( ix ) FEATURE : 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /raod_base= OTHER 
30 /note= " N = 2 ' -deoxyadenosine" 

(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /mod_base= OTHER 
3S /note= "N = 2 ' -deoxyadenoaine" 

(ix) FEATURE: 

(A) NAME /KEY: modif iedbase 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= *N = 2 ' -deoxyadenosine covalently 
modified at the 3' phosphate group with 
a hexaethyleneglycol (HEG) linker" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
45 NTNTNNTNTN 10 



40 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /deoc = "2 ' -deoxynucleoeide/nucleoside 
analogue decanucleotide probe" 

5 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N - 2 '-deoxyadenosine* 

10 

(ix) FEATURE: 

(A) NAME/KEY: modified baee 
<B) LOCATION: 2 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = 5-propynyl-2 ' -deoxyuridine" 

' 5 (ix) FEATURE: 

(A) NAME /KEY: modif iedbase 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = 2 ' -deoxy adenosine " 

20 (ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= M N = 5-propynyl-2 '-deoxyuridine" 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "M « 2 '-deoxyadenosine" 

(ix) FEATURE: 
30 (A) NAME/KEY: modified base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = 2 '-deoxyadenosine" 

(ix) FEATURE: 
3S (A) NAME /KEY: modif ied_base 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /mod_baae= OTHER 

/note= "N «= 5-propynyl-2 '-deoxyuridine" 

(ix) FEATURE: 

(A) NAME/KEY: modified base 
40 (B) LOCATION: 8 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = 2 '-deoxyadenosine" 

( ix ) FEATURE : 

(A) NAME/KEY: modif ied^base 
45 (B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = 5-propyny 1-2 '-deoxyuridine" 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 10 

50 (D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = 2 '-deoxyadenosine covalently 
modified at the 3' phosphate group with 
a hexaethyleneglycol (HEG) linker* 1 
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(xi) SEQUENCE DESCRIPTION 2 SEQ ID NO: 23: 
NNNNNNNNNN 10 

5 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /deec = "2 ' -deoxynucleoside/nucleoside 
15 analogue decanucleotide probe" 



(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 1 

20 (D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = 2-amino-2 ' -deoxyadenosine" 

(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 3 

25 (D) OTHER INFORMATION: /mod_base= OTHER 

/note= M N = 2-amino-2 ' -deoxyadenosine" 

(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 5 

30 (D) OTHER INFORMATION: /mod_base= OTHER 

/note= «n s 2-amino-2' -deoxyadenosine" 

(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 6 

35 (D) OTHER INFORMATION: /mod_base= OTHER 

/note** "N = 2 -amino-2 '-deoxyadenosine" 

(ix) FEATURE: 

(A) NAME /KEY: modif iedjbase 

(B) LOCATION: 8 

40 (D) OTHER INFORMATION : /mod_base= OTHER 

/note= "N = 2-amino-2 '-deoxy adenosine" 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 10 

45 (D) OTHER INFORMATION: /mod_base= OTHER 

/note= B N = 2 -amino-2 '-deoxyadenosine 
covalently modified at the 3' 
phosphate group with a 
hexaethyleneglycol (HEG) linker" 



50 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
NTNTNNTNTN 10 
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(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /deac » n 2 '-deoxynucleoside/nucleoside 
analogue decanucleotide probe" 



(ix) FEATURE: 

(A) NAME/KEY: modif iedjbase 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= " N = 2-amino-2 ' -de oxy adenosine" 

<ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = 5-propynyl-2 ' -deoxyuridine" 

(ix) FEATURE: 

(A) NAME/KEY: modif iedbaae 

(B) LOCATION: 3 

<D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = 2-amino-2 ' -deoxyadenosine" 

(ix) FEATURE : 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /mod_baee= OTHER 

/note= "N = 5-propynyl-2 '-deoxyuridine" 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = 2-amino-2 ' -deoxyadenosine" 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_baee 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_baBe= OTHER 

/note= "N = 2 -araino-2 '-deoxyadenosine* 

(ix) FEATURE: 

(A) NAME/KEY: modif iedbase 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /mod_baee= OTHER 

/note= "N = 5 -propyny 1-2 '-deoxyuridine" 

(ix) FEATURE: 

(A) NAME/KEY: modif iedbase 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /mod_baoe= OTHER 

/note= "N ~ 2-amino-2 '-deoxyadenosine" 

( ix ) FEATURE : 

(A) NAME/KEY: modif iedbase 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = 5-propynyl-2 '-deoxyuridine" 
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(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 10 

(D) OTHER INFORMATION: 



/raod_base= OTHER 

/note= "N = 2-amino-2 ' -deoxy adenosine 
covalently modified at the 3' 
phosphate group with a 
hexaethylenegiycol (HEG) linker" 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
NNNNNNNNNN 



10 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME / KEY : modif ied_base 
25 (B) LOCATION: 1 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = thymine covalently modified 
at the 5 ' hydroxyl group with a 
fluorescein molecule w 

30 (ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /raod^base^ OTHER 

/note= W N = thymine covalently modified 
at the 3 ' phosphate group with a 
35 hexaethylenegiycol (HEG) linker which is 

covalently bound to the 5' phosphate 
group of the 5' guanine (N in pos. 1) of 
SEQ ID NO:27" 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

NATATTATAN 10 



(2) INFORMATION FOR SEQ ID NO:27: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 1 
(D) OTHER INFORMATION 



10 



/mod_base= OTHER 
/note= "N = guanine covalently modified 
at the 5 ' phosphate group with a 
hexaethyleneglycol (HEG) linker which is 
covalently bound to the 3' phosphate 
group of the 3' thymine <N in pos. 10) 
Of SEQ ID NO: 26" 



15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
NCGCGGCGCG 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: DNA 



25 (ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 6.. 10 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= W N = guanine (G) , 
2 ' , 3 ' -dideoxy guanine (ddG ) , 
2 '-deoxyinosine (dl) or thymine (T)* 

( ix ) FEATURE : 

(A) NAME /KEY: modif ied_base 

(B) LOCATION i 15 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= M N = cytosine covalently modified 
at the 5 ' phosphate group with a 
hexaethyleneglycol (HEG) linker" 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

TGGGCNNNNN TTGTN 15 



30 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 
(8) LOCATION: 1 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = cytooine covalently modified 
at the 5' phosphate group with a 
fluorescein molecule" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
NAATACAACC CCCGCCCATC C 21 



75 

Claims 

1 . A composition comprising an array of oligonucleotide analogues attached to a solid substrate. 
20 2. A composition of claim 1, wherein either:- 

(a) said array of oligonucleotide analogues comprises a nucleoside analogue with the formula 



25 



30 




wherein: 

35 

R 1 is selected from hydrogen, methyl, hydroxy, alkoxy, alkylthio, halogen, cyano, and azido; 
R 2 is selected from hydrogen, methyl, hydroxy, alkoxy, alkylthio, halogen, cyano, and azido; and 
Y is a heterocylic moiety; or 

40 (b) said array of oligonucleotide analogues comprises a nucleoside analogue with the formula 



45 




50 

wherein: 

R 1 is selected from hydrogen, hydroxyl, methyl, methoxy, ethoxy, propoxy, allyloxy, propargyloxy, Fluorine, 
Chlorine, and Bromine; 

55 R 2 is selected from hydrogen, hydroxyl, methyl, methoxy, ethoxy, propoxy, allyloxy, propargyloxy, Fluorine, 

Chlorine, and Bromine; and 

Y is a base selected from purines, purine analogues, pyrimidines, pyrimidine analogues, 3-nitropyrrole and 
5-nitroindole. 
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3. A composition of claim 1 , wherein said array of oligonucleotide analogues comprises a nucleotide with a base 
selected from 7<Jeazaguanosine, 2-aminopurine, 8-aza-7-deazaguanosine, 1H-purine, and hypoxanthine. 

4. A composition comprising an oligonucleotide analogue array synthesised on a solid substrate, wherein said syn- 
thesis is performed by light-directed chemical coupling or by flowing oligonucleotide analogue reagents over pre- 
determined regions of the solid substrate; optionally wherein said solid substrate is derivitised with a silane reagent 
prior to synthesis of said oligonucleotide analogue. 

5. A method of improving the hybridisation of a nucleic acid to an oligonucleotide array, comprising incorporating a 
base selected from 7-deazaguanosine, 2-aminopurine, 8-aza-7-deazaguanosine, 1H-purine. and hypoxanthine 
into the oligonucloetides of the array; optionally wherein the oligonucleotide is a homopolymer. 

6. A method of determining if a target molecule is complementary to a probe, comprising the steps of: 

(a) synthesising an oligonucleotide analogue array on a solid substrate; 

(b) exposing said oligonucleotide analogue array on said solid substrate to a target nucleic acid, optionally after 
amplification of said target nucleic acid; and 

(c) determining whether an oligonucloetide analogue member of said oligonucleotide analogue array binds to 
said target oligonucleotide. 

7. A method of daim 6 wherein said target nucleic acid is selected from genomic DN A, cDNA, unspliced RNA, mRNA, 
and rRNA. 

8. A composition comprising an array of oligonucleotide probes hybridised to a target nucleic acid, which target 
nucleic acid comprises a nucleotide analogue; optionally wherein the target nucleic acid is a PCR amplicon. 

9. A method of detecting a target nucleic acid, comprising enzymatically copying the target nucleic acid using nucle- 
otides which comprise a nucleotide analogue, thereby producing a nucleic acid analogue amplicon, and hybridising 
the nucleic acid amplicon to an oligonucleotide array. 

1 0. A method of claim 9, wherein the oligonucleotide array comprises an oligonucleotide analogue probe which is com- 
plementary to the nucleic acid analogue amplicon. 
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Figure 2 
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adenine vs. 2,6-diaminopurine in 
3'-CATCGTAGAA-5' 
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Figure 3 
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Substitution A->D in AAAAANAAAAA 
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Figure 4 
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Effect of A->0 and T->P substitution on hybridization to 
ATATAATATA (open) vs. CGCGCCGCGC (solid); hybridization 
time s 3 hr & 20 hr @ 22°C 
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Figure 5 



42 



EP 0 742 287 A2 



FIGURE C: Effect of dl & 7-deaza-dG substitution in 3' 
ATGTT(G1 G2G3G4G5)CGGGT-5' 
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